Skip to content

DAOS-18607 object: increase RPC retry latency#18015

Closed
Nasf-Fan wants to merge 2 commits intorelease/2.6from
Nasf-Fan/DAOS-18607_2
Closed

DAOS-18607 object: increase RPC retry latency#18015
Nasf-Fan wants to merge 2 commits intorelease/2.6from
Nasf-Fan/DAOS-18607_2

Conversation

@Nasf-Fan
Copy link
Copy Markdown
Contributor

@Nasf-Fan Nasf-Fan commented Apr 15, 2026

To reduce server load that is caused by resent IO requests.

More fixes:

  1. Use current ULT to send DTX RPC instead of via dss_chore.

  2. Add some log message for -DER_AGAIN cases.

Steps for the author:

  • Commit message follows the guidelines.
  • Appropriate Features or Test-tag pragmas were used.
  • Appropriate Functional Test Stages were run.
  • At least two positive code reviews including at least one code owner from each category referenced in the PR.
  • Testing is complete. If necessary, forced-landing label added and a reason added in a comment.

After all prior steps are complete:

  • Gatekeeper requested (daos-gatekeeper added as a reviewer).

@github-actions
Copy link
Copy Markdown

Errors are Unable to load ticket data
https://daosio.atlassian.net/browse/DAOS-18607

@Nasf-Fan Nasf-Fan force-pushed the Nasf-Fan/DAOS-18607_2 branch from bc2e347 to fe43b2c Compare April 15, 2026 05:01
gnailzenh
gnailzenh previously approved these changes Apr 15, 2026
Comment thread src/object/cli_obj.c
/* Increase delay after multiple times retry. */
if (*retry_cnt >= 5)
delay <<= 1;
if (*retry_cnt > 10 || delay > 3000000)
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we can just 10X each time and start from 10us, so we eventually retry after each 10 seconds after 6 tries.
Let's see if it can help first.

To reduce server load that is caused by resent IO requests.

More fixes:

1. Use current ULT to send DTX RPC instead of via dss_chore.

2. Add some log message for -DER_AGAIN cases.

Signed-off-by: Fan Yong <fan.yong@hpe.com>
@Nasf-Fan Nasf-Fan force-pushed the Nasf-Fan/DAOS-18607_2 branch from 2d027e0 to ab11981 Compare April 16, 2026 13:43
@Nasf-Fan
Copy link
Copy Markdown
Contributor Author

This patch is only used for test with some debug information. If it is verified useful, then we will land related fixes via another clean PR:
master: #18032
release/2.6: #18036

@mchaarawi
Copy link
Copy Markdown
Contributor

if this PR is not needed anymore, please close and delete the branch.

@Nasf-Fan
Copy link
Copy Markdown
Contributor Author

The patch #18036 has already been landed to release/2.6, we do not need this one any longer.

@Nasf-Fan Nasf-Fan closed this Apr 22, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

3 participants